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EXPRESSION OF DNA SEQUENCES DERIVED 
FROM NOCARDIOFORM MICROORGANISMS 
Background of the Invention 
This invention relates to gene expression. 

5 The organisms grouped as Nocardiof orms in 

Bergev ' s Manual of Systematic Bacteriology (Williams and 
Wilkins, 1986, pp. 1458-1481) include nine genera which 
are close phylogenetically to Corvnebacterium, 
Athrobacter and Mycobacterium . Members of the 

10 Nocardiof orms including the genera Nocardia and 

Rhodococcus exhibit a wide range of useful metabolic 
activities including the assimilation of unusual 
compounds such as alkanes and aromatic hydrocarbons; the 
degradation of lignin, detergents and pesticides; the 

15 production of enzymes useful in xenobiotic 

transformations; and the biosynthesis of antibiotics, 
amino acids and biosurf actants . In particular, this 
group of bacteria have been exploited for their 
production' of important steroid modifying enzymes, 

20 including cholesterol esterase and cholesterol oxidase, 
which are used in diagnostic assays to determine 
cholesterol levels in blood and serum. 

The capacity to degrade cholesterol is 
widespread among microorganisms other than the 

25 Nocardiof orms. For example, cholesterol oxidases have 
also been isolated from species of Mycobacterium, 
■ pseudomonas , and Streotomyces . It is well known from 
scientific and patent literature and from commercial 
practice that the Nocardioform-type cholesterol oxidase 

30 is distinct from the other microbial cholesterol 

oxidases . . 

The Nocardioform enzyme is very stable and 
active over a wide pH range (pH 6.0 - 8.0), the Km for 
cholesterol is 1.4 x 10~ 5 mol/liter at 25°, and the 
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enzyme is highly specific for A - or A -313-sterols . 
As produced in the Nocardiof orms , cholesterol oxidase is 
membrane associated and requires detergent extraction 
and lipid removal for purification. 

5 Singer, et al*, j. Bacterid., 1988, Vol. 170, 

pp. 638-645, describes expression of genes in 
Rhodococcus sp. from both coli and Streptomyces sp. 
and the development of a shuttle plasmid based on an E. 
coli plasmid and containing a Rhodococcus 

10 plasmid-cerived origin of replication. The plasmid is 

capable of replicating in coli and in several but not 
all of the Rhodococcus species tested. 

Summary of the Invention 
We have discovered that recombinant DNA 

15 sequences derived from Nocardioform microorganisms can 
be expressed efficiently in a host microorganism. 

The invention features, in one aspect, a 
cloning vector which expresses DNA sequences, derived 
from a Nocardioform organism, in a host microorganism. 

20 In preferred embodiments, the DNA sequence is derived 

from Rhodococcus sp. NCIB 10554 (National Collection of 
Industrial Bacteria, Aberdeen, Scotland); the DNA 
sequence encodes cholesterol oxidase; the organism in 
which the cloning vector can express the Nocardioform 

25 DNA is a gram-positive microorganism, more preferably a 
species of Streptomyces , and most preferably S. 
lividans ; and the cloning vector is the plasmid pSL81.. 

Cholesterol oxidase is produced according to 
the invention by providing a cloning vector containing a 

30 DNA sequence encoding cholesterol oxidase, which is 

derived from a Nocardioform microorganism, transforming 
a host cell with the cloning vector to obtain a 
recombinant host cell, culturing the recombinant host 
cell under conditions* permitting expression of the DNA 
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sequence, and recovering the cholesterol oxidase. In 
preferred embodiments, the Nocardioform organism is 
Rhodococcus sp. NCIB 10554; y the host cell is a 
gram-positive microorganism, preferably a species of 

5 strentomyces , and more preferably lividans ; and the 

cloning vector is the plasmid pSL81. 

Cholesterol oxidase produced according to the 
invention can be used for determining cholesterol in a 
sample, by contacting the sample with the recombinant 

10 cholesterol oxidase and determining the extent of 

oxidation of cholesterol; specifically, by measuring 
oxygen consumption with an oxygen electrode; or by 
measuring the production of hydrogen peroxide; or by 
spectrophotometrically determining cholesterol 

15 conversion to A -cholestenone. 

The invention provides for the expression of 
DNA sequences derived from a Nocardioform microorganism 
in a host microorganism, that is easier to manipulate 
under conditions of commercial production than is the 

20 parent organism itself, and for which cloning techniques 
are known. Recombinant cholesterol oxidase derived from 
Nocardioform microorganisms is expressed as an 
extracellular protein, an expression condition which can 
lead to substantially higher production of enzyme. 

25' Moreover, the expression does not require addition of an 
inducer, and the expressed enzyme is free of the lipids 
of the Nocardioform membrane. 

Other features and advantages of the invention 
will be apparent from the following description of the 

30 preferred embodiment thereof, and from the claims. 

Description of the Preferred Embodiment 
The drawing will first be described. 

Drawing 

Fig. 1 is a restriction endonuclease map of 
plasmid pSL81. 
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Fig. 2 (2.1-2.6) is a sequence description of 
the Rhodococcus 10554 cholesterol oxidase gene. 

Fig. 3 is a restriction endonuclease map of 
plasmid pSLSl, showing the position of the pSL81A15 

5 fragment. * 

Gene Isolation and Expression 
The expression of DNA sequences derived from a 
Nocardioform microorganism can be accomplished, in 
general, by first constructing a genomic bank or library 

I0 by incorporating the DNA sequences into a suitable 
cloning vector containing a suitable origin of 
replication compatible with the desired host organism. 
Next, a host cell from any microorganism, advantageously 
any gram-positive microorganism f such as, for example, 

15 species of Streotomyces or Bacillus , is transformed with 
the vector containing the DNA sequences. Then, 
transformants are screened for the activity of the 
desired gene product, and active clones are isolated and 
characterized. 

20 Construction of Library 

The following protocol, presented for 
illustrative purposes, describes construction of a 
genomic library of Rhodococcus sp. NCIB 10554 
( Rhodococcus 10 554) DNA in a host such as S. lividans 

25 which is capable of expressing the DNA from the parent 
microorganism. In the. example described below, a 
population of transformed cells prepared using this 
library is screened for. the expression of the 
cholesterol oxidase gene of Rhodococcus 10554. 

- 30 Generally/ a genomic library of Rhodococcus 

10554 DNA was constructed from high molecular weight DNA 
• which had been isolated and fragmented with the 
restriction enzyme Sau3A to yield fragments in the size 
range 4-8 kilobase pairs. Size fractionated. DNA from 
Rhodococcus 10554 was then ligated to the cloning 
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plasmid pIJ702 which had been digested with Ball I. The 
pIJ702 plasmid is commercially available through the 
American Type Culture Collection (Rockville, MD, USA). 
Hybrid plasmids of the library were transformed into S^. 

5 lividans cells to generate the genomic library of 
Rhodococcus 10554 DNA in the host microorganism. 

Details of the procedure were as follows: A 
single colony of Rhodococcus 10554 was used to inoculate 
50 ml of NBYE medium which contained per liter: Difco 

10 nutrient broth, 8g, and Difco yeast extract, 5g. Other 
appropriate media described for growing Rhodococcus sp. 
could also be used. The culture was grown for 48 hours 
at 30°C and the cells collected by centrifugation at 
10,000 rpm for 15 minutes in a Sorvall RC-5B centrifuge 

!5 using an SS-34 rotor. The cells were resuspended in 18 
ml of Tris-glucose (lOmM Tris-HCl, pH 8.0, lOmM EDTA, 
25% w/v glucose), and 2 ml of lysozyme solution (10 
mg/ml lysozyme (Sigma) in. Tris-glucose solution) was 
added. This mixture was incubated at 37°C for 60 

20 minutes. Next, 2.4 ml of 10% w/v sodium dodecyl sulfate 
(Sigma) was added and the mixture stirred vigorously to 
form a homogenous solution of lysed cells. The lysed 
cells were incubated at 55°C for 2 hours, and then 2.7 
ml of 5M NaC10 4 (Sigma) were added. Denatured protein 

25 and cell debris were removed by extracting the lysed 
cell solution twice with 20 ml chloroform: isbamyl 
alcohol (24:1) and resolving the phases by 
centrifugation at 12,000 rpm for 10 minutes in a Sorvall 
SS-34 rotor . The aqueous phase was removed and two 

30 volumes of cold ethanol added. The solution was mixed 
gently, and the high molecular weight DNA was collected 
using a glass rod as described by Marmur et al., 1961, 
J. Mol. Biol., Vol. 3, pp. 203-218. The collected DNA 
was washed twice in 70% ethanol and dissolved in 5 ml TE 



(lOmM Tris, ImM EDTA* pH 8.0). DNAse free KNAse (Sigma) 
was added at 100 mg/ml to the DNA solution and incubated 
at 42°C*for 30. minutes. Next, 0.6 ml of 10% w/v of 
sodium dodecyl sulfate and 0.6 ml of 5M NaCl were added, 

5 and the mixture was extracted once with 

phenol: chloroform (1:1) and once with chloroform: isoamyl 
alcohol (24:1). DNA was precipitated with 2 volumes of 
ethanol and collected as before using a glass rod. The 
DNA was washed once with 70% ethanol and air dried prior 

10 to resuspension in TE to a final concentration of 
approximately 1 mg/ml. 

In order to establish the condition for 
isolating partially digested high molecular weight 
Rhodococcus 10554 DNA, test reactions were set up to 

15 digest DNA with different concentrations of Sau 3A 

restriction enzyme. A reaction mixture was prepared 
vith 10 ]xl DNA, 15 \il lOx Sau 3A buffer (200mM Tris 
HC1, pH -7.4, 50mK MgCl 2 , 500mM KC1) and 125 \xl 
water. Aliquots of 15 jal were dispensed into tubes 

20 labeled 1-9 * and chilled on ice. Exactly 4 units of 
Sau3 A enzyme (10 units/jil BRL) were added to tube 1 
and mixed well. Two-fold serial dilutions of the 
reaction mixtures from tube 1 sequentially through to 
tube 8 were performed to yield reaction mixtures 

25 containing two-fold decreasing concentrations of Sau 3A 
enzyme. Tube 9 which contained no enzyme represented 
the undigested control reaction. The tubes were 
incubated at 37°C for i hour and stopped by returning 
the tubes to ice and adding EDTA (20mM), The extent of 

30 digestion in each tube was monitored by agarose gel . 
electrophoresis, and the sample which yielded DNA 
fragments in the approximate range of 2-20 kb was. scaled 
up 200 fold and the .reaction* repeated. The digested 
Rhodococcus 10554- DNA was separated by agarose gel 
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electrophoresis using low gelling temperature agarose 
(SeaPlaque, FMC) . The region. of the gel containing DNA 
fragments in the size range, 4-8 kb was excised with a 
sterile scalpel and the DNA recovered by electro- 

5 elution. The DNA was precipitated by conventional 

methods, and the precipitate was resuspended in TE to a 
concentration of 0.025 mg/ml . 

A single colony of lividans /pIJ702 was used 
to inoculate 50 ml of YEME which contained per liter: 

10 Difco yeast extract, 3g; Difco peptone, 5g; Oxoid malt 
extract, 3g; glucose, lOg; sucrose, 340g; MgCl 2 at 
5mM; and glycine at 0.4% w/v; and was supplemented with 
the plasmid selective agent thiostrepton (CalBiochem) , 
at 50 ng/ml. Other media described for Streptomyces 

15 could also be used. The culture was incubated at 30 °C 
for 3 days. The cells were harvested by centrifugation 
at 10,000 rpm for 30 minutes using a Sorvall centrifuge 
and SS-34 rotor. The pellet was resuspended in 5ml TSE 
Buffer (25mM Tris, pH 8.0; 0.3M sucrose; 25mM EDTA) 

20 containing 10 mg lysozyme. The cells were incubated at 
37°C for 30 minutes. A 3 ml volume of alkaline SDS 
solution (0.3M NaOH, 2% w/v sodium dodecyl sulfate) was 
added, and the mixture was incubated at room temperature 
for 10 minutes and at 70 °C for 10 minutes. The lysed 

25 cell solution was cooled to room temperature and 

extracted with 1 volume of phenol: chloroform (1:1). 
The aqueous phase was removed to a clean tube and 0.3M 
sodiurt) acetate, pH 4.8, and l volume of isopropanol 
added. The DNA was precipitated for 30 minutes at -20°C 

30 and recovered by centrifugation, using a Sorvall SS-34 
rotor at 10,000 rpm for 10 minutes. The pellet was 
resuspended in 2 ml TE and the plasmid DNA purified by 
ultracentrif ugation to equilibrium in a cesium chloride 
density gradi nt according to known techniques. 
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Plasmid pIJ702 prepared as above was 
resuspended in TE to a concentration of 1 mg/ml. 
About 10 pi (10 pg) of " DNA was added to 5 pi lOx 
Bglll buffer (SOOmM Tris HC1, pH 7.4; 50mtt MgCl 2 ; 

5 SOOmM KCl) , 32 pi distilled water and 3 pi of Bglll 
restriction enzyme (10 units/pl, BRL) and incubated 
for 2 hours at 37 a C. The reaction was terminated by one 
extraction with .phenol : chloroform (1:1) and one 
extraction with chloroform: isoamyl alcohol (24:1). The 

10 DNA was precipitated by the addition of one-half volume 
of 7.5M ammonium acetate and 2 volumes of cold ethanol. 
After incubation at -20 °C for at least 2 hours, the DNA 
was collected in an Eppendorf microfuge by 
centrifugation for 10 minutes . The DNA pellet was - 

15 washed once with 70%. ethanol , .dried and resuspended in 
50 pi lOmM Tris HCl , pH 8.0. To dephosphorylate the 
D A f 1 pi of calf ; intestine alkaline phosphatase (2 
U/pl, IB I) was added and the reaction incubated at 
37°C for 30 minutes. The reaction was stopped by adding 

20 150 pi stop buffer (lOmM Tris HCl, pH 7.5; IroM EDTA, 

. pH 7.5; 200mM'NaCl; 0.5% w/v sodium dodecyl sulfate) and 
then extracted once with phenol: chloroform (1:1) and 
once with chloroform: isoamyl alcohol (24:1). The DNA 
was precipitated with ethanol, dried and resuspended in 

25 TE to a final concentration of 0.05 mg/ml .and stored at' 
-20°C, . 

About 10 pi (0.5 pg) of the dephosphorylated 
Bglll digested pIJ70'2 DNA was added to about 20 pi 
(0.5 pg) of the Sau3A digested Rhodococcus DNA. A 
30 20 pi aliquot of 5x ligase buffer (BRL), 0.5 pi 
ligase (NEB) and 50 pi water were added and the 
ligation reaction incubated at 14°C for 20 hours. The 
reaction was terminated by heating the mixture to 65 °C 
for 10 minutes, and "the sample was stored at -20°C- 
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A vegetative inoculum was prepared by growing 
S. lividans in 25 ml culture broth as described for the 
preparation of lividans /pIJ702 , except the medium 
contained no thiostrepton. The cells were harvested in 

5 a benchtop centrifuge at 3,000 rpm for 10 minutes and 
washed twice with 10 ml of 10.3% w/v sucrose. The cell 
pellet was resuspended in 4ml lysozyme medium (L-medium, 
Thompson et al./ 1982, J, Bacteriol., Vol. 151, pp. 
668-677) and incubated at 3Q°C for 30 minutes. A 5 ml 

10 aliquot of protoplast medium (P-medium, Okanishi, et 

al., 1974, J. Gen. Microbiol., Vil. 80, pp. 389-400) was 
added and the solution mixed by pipetting up and down. 
Protoplasts were * filtered through glass wool (which 
traps mycelia but allows protoplasts to pass through) 

15 harvested by centrifugation, and resuspended in 2 ml 

P-medium. Protoplasts were used fresh or stored frozen 
in P-medium at -70 °C. 

Up to 3 yl of ligated plasmid DNA in ligation 
buffer was mixed gently with 50 \xl lividans 

20 protoplasts in a small sterile tube. To this mixture, 
200 ]xl transformation buffer (T-buffer, Thompson, et 
al., 1982, J. Bacteriol,, Vol. 151, pp. 668-667) was 
added and pipetted to mix. Immediately, the cells were 
spread gently onto R2YE medium (Thompson, et al., 1980, 

25 Nature (London), Vol. 286, pp. 525-527) and incubated 
overnight at 30°C. The regenerated protoplasts were 
overlaid with filter paper (Whatman 541, 9.0cm) 
containing 100 jag/ml thiostrepton and incubated at 
30 °C for 3 days. About 200-500 transf ormants per plate 

30 were obtained using these transf ormation ' conditions . 

Approximately 12,000 thiostrepton-resistant lividans 
transf ormants were obtained in this way. Over 70% of 
these transf ormants contained recombinant pIJ702 
plasmids as estimated by miniprep DNA analysis. These 
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recombinant plasmids represented the genomic library of 
Rhodococcus 10554 DNA. 

Screening for Cholesterol Oxidase Activity 

As an example, showing the expression of one 

5 DNA sequence derived from a Nocardioform microorganism, 
recombinant lividans cells containing the genomic 
library of Rhodococcus 10554 DNA were screened for the 
expression. of cholesterol oxidase activity: Cholesterol 
oxidase was determined using a modified method of Allain 

10 et al., 1974 /Physical Cheirw , Vol. 20, pp. 470-475.. 
Filter paper discs were scaked in an assay buffer 
containing 4-aminoantipyrine, phenol, horse radish 
peroxidase and triton-XiOO, and laid down on colonies 
growing on agar medium .containing cholesterol 

15 substrate. Alternatively, these colonies could be 
assayed by halo formation on agar medium containing 
cholesterol, in which halo formation depended on the 
conversion of cholesterol to cholestenone by soluble 
cholesterol oxidase. Recombinant lividans colonies 

20 ■ which produced cholesterol oxidase were purified, and 
characterized. 

Details of the screening of the genomic library 
for cholesterol oxidase activity were as follows: 
Filter papers used to apply thiostrepton drug to R2YE 

25 transformation plates for the selection of lividans 
transformants, were lifted and placed onto screening 
plates [medium containing per liter; agar, 15g; 
glycerol, 20g; yeast extract, lg; MgSO^, 0.66g; buffer 
salts, 66 ml (per liter: Na 2 HP0 4 , 28g; NaH 2 P0 4 , 

30 30g; K 2 HP0 4 , 20g; (NH 4 ) 2 S0 4 , 127,5 g) and 

trace salts, 2 ml (per liter: concentrated HC1, 250 ml; 
CaCl 2 , 3.57g; ZnS0 4 , 20g; CuCl 2 , 0.85g; NaMo0 4 , 
4.8g; MnCl 2 , 2.0g; FeCl 3 , 5.4g; boric acid, 0.3g; 
CoCl 0 > 2.4g)J with cholesterol (1.6roM, Sigma) and 
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thiostrepton (50 \ig/ml) . These replica plates were 
incubated at 30 a C for 3 days. The replica filters were 
removed and replaced with sterile filter paper discs 
soaked in cholesterol oxidase assay reagent (lOOmM 

5 citric acid buffer, pH 6.0; 0.2mM 4-aminoantipyr ine 
(Aldrich); horse radish peroxidase I, 600 units per 
liter; 6.4mM cholesterol; lOmM phenol and 4% w/v Triton 
x 100). The plates were incubated at 30°C, and the 
colonies were screened over a 4 hour period for the 

10 development of a red zone due to cholesterol oxidase 
activity which diffused onto the white filter paper. 
After incubation at 30°C for 4 hours, the assay reagent 
filter papers were removed from the screening plates, 
and the plates were re-incubated at 30°C overnight to 

15 allow for the formation of clear halos associated with 
cholesterol oxidase degradation of cholesterol. From 
the approximately 8,000 recombinant lividans colonies 
screened, two showed cholesterol oxidase activity. The 
plasmid DNA from one of these isolates was called pSL81 

20 and was analyzed further. 

Plasmid DNA from the lividans /pSL81 
recombinant was isolated and transformed back into 
lividans using the same procedure described for 
isolating and transforming pIJ702. All the 

25 thiostrepton-resistant transformants obtained contained 
the same plasmid pIJ702 with approximately 6 kb insert 
DNA, and showed the cholesterol oxidase activity 
diagnostic of the original recombinant. Curing of 
plasmid pSL81 from lividans cells by growth in the 

30 absence of drug selective pressure resulted in the 
simultaneous loss of thiostrepton resistance and 
cholesterol oxidase activity. Southern hybridization 
demonstrated that the insert DNA was derived from the 
Rhodococcus 10554 genome. Referring to Fig. 1, the 
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recombinant plasmid designated pSL81 includes cloning 
vector pIJ702 12 and insert DNA 14 from Rhodococcus . 
10554/ and has restriction endonuclease sites as 
indicated. 

5 Plasmid pSL81,. prepared as above, was used to 

probe chromosomal digests \ from the cholesterol oxidase 
producing Nocardioforms Rhodococcus UCIB 10554, R. 
ervthropolis NCIB 9158, Nocardia ervthropolis ATCC 17895 
and Nocardia erythropolis ATCG 4277, The strains all 

10 showed cross-hybridizing DNA .bands suggesting that the 
cholesterol oxidase coding region in these Nocardioforms 
is the same. Plasmid pSLSl showed no specific cross 
hybridization with DNA isolated from a cholesterol 
oxidase producing Streptomyces , namely Streptomyces 

15 violascene NRRL B-2700. 
. UNA Sequencing 

In order to sequence the cholesterol oxidase 
gene, subclones of pSLSl were constructed and a 
cholesterol oxidase coding region was localized to a 

20 2.8 kb Bcrl ll fragment in the subclone pSL81A15 

(Fig. 3). This fragment was then introduced into 
coli on plasmid pBR322, and a series of deletion 
derivatives were constructed using available restriction 
sites. Sequencing was performed on denatured double 

25 stranded plasmid DNA using the dideoxy chain termination 
method of Sanger et al . , PNAS USA, Vol. 74, pp. 
54.63-67. Specifically, a Sequenase Version 2.0 kit 
(United States Biochemical Corp. ) was used according to 
the manufacturer's specifications. Deoxyguanidine 

30 triphosphate or Deoxy-7-deazqguanidine triphosphate was 
used to terminate chain polymerization reactions 
depending on the degree, of G+C content of a particular 
region .of DNA being sequenced. The primer extended 
oligonucleotides were electrophoresed on denaturing 6% 
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polyacrylamide and the DNA sequences of adjacent DNA 
fragments were pieced together to construct a complete 
DNA sequence of the cholesterol oxidase gene. The 
sequence is shown in Fig. 2. 

5 Subcloning of the Rhodococcus cholesterol 

oxidase gene in pSL8lA15 can result in an increase in 
expression of the cholesterol oxidase protein. 
Recombinant lividans cells were grown on RZYE agar 
containing 50pg/ml thiostrepton. After 2-3 days 

10 incubation at 30°C, the mycelia and spores were scraped 
from the surface of the agar and used to inoculate 50 ml 
YEME +12.5 \ig/ml thiostrepton seed cultures. These 
cultures were grown at 30 °C with vigorous shaking and 
after 48 h, 2 1 fermenters containing YEME +12.5 

15 pg/tnl thiostrepton were inoculated with the seed. The 
fermentations were controlled at 28°C and pH 7.0 with 
aeration at 0.5 1/min and agitation at 1000 rpm. 
Cholesterol oxidase activity peaked after 48 h and in 
some such cultures expression was substantially greater 

20 in the 1 iv idans /pSL8 1 A 1 5 fermentations than in the 
S . lividans /pSL81 fermentations. 
Protein characterization 

Recombinant cholesterol oxidase was isolated 
from 3 day old cultures of lividans /pSL81 cells grown 

25 in YEME (25 ml) containing thiostrepton (50 |jg/ml). 

Non-recombinant cholesterol oxidase was isolated from 48 
hour Rhodococcus 10554 cultures grown in liquid 
production medium which was the same medium as that used 
for the screening plates not solidified with agar. 

30 Induction experiments were performed using the same 
growth conditions in the presence of "sterol as an 
inducer. Intracellular and extracellular fractions were 
recovered by centrifugation and assayed for cholesterol 
oxidase activity using a cholesterol oxidase specific 
diagnostic assay. The specificity of the recombinant 



cholesterol oxidase from lividans and the purified 
cholesterol oxidase from Rhodococcus 10554 were compared 
using various sterol substrates. Both enzymes shoved 
similar profiles with the order of activity being 

5 cholesterol, pregnenolone, dihydroxycholesterol, 
stigraasterol and ergosterol. 

Recombinant cholesterol oxidase from 
S. lividans /pSLSl was functional in standard diagnostic 
assays for the determination of cholesterol. An aliquot 

10 of cell-free supernatant from a 3 day old culture of 
YEME grown recombinant cells was mixed with the 
cholesterol oxidase assay reagent (1.4 ml of 0,33 mg/ml 
4-aminophenazone in 0.1M phosphate buffer, pH 7.0, 0.05% 
Triton X-100 (w/v) ; 1,4 ml of 1 mg/ml phenol in 

i5 phosphate-Triton X buffer; 0,1 ml of 4mM cholesterol; 
and 0.01 ml of 10 mg/ml peroxidase) and incubated at 
30 °C. Hydrogen peroxide formed during the oxidation of 
cholesterol and chelated with the aminoantipyrine in the 
assay mixture was determined by measuring the absorption 

20 of the red complex at 505nm versus a reagent blank 

containing water. The rate of cholesterol degradation 

by the recombinant enzyme was also measured directly by 

following the increase in absorbance at 240 nm owing to 
4 

A -cholestenone production. To 2.9 ml of sodium 
25 phosphate buffer (0.1M, pH 7.0) in a cuvette, 100 jil 
of 3% Triton X-100 (w/v) and 50 jil of a 6mM 
cholesterol solution were added. Diluted enzyme at 
approximately 0.2 U/ml in buffer was added and the 
absorbance at 240nm monitored using a Beckman 
30 spectrophotometer. Cholesterol oxidase activity was 
calculated as: 



iinits/mg - OD/min x RV 

. .12.3 x DF x SV x PC (mg/ml), 
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where 

RV = reaction volume 

DF = dilution factor 

* 

SV = sample volume 
5 PC = protein concentration 

Further characterization of the recombinant cholesterol 
oxidase from lividans /pSL81 demonstrated that the 
enzyme could also be used to determine cholesterol in 
sera using the peroxidase coupled reaction, 

10 Recombinant cholesterol oxidase from 

lividans /pSL81 and purified Rhodococcus 10554 
cholesterol oxidase were analyzed by polyacrylamide gel 
electrophoresis using a 10% polyacrylamide gel and the 
buffer system of Laemelli, 1970, Nature, Vol. 227, pp. 

15 680 et sea. Both enzymes were determined to be 

approximately 55 kD in size when compared with Biorad 
molecular weight standards. 
Peptide sequencing 

Recombinant cholesterol oxidase from 

20 lividans /pSL81A15 and non-recombinant cholesterol 

oxidase from R^ rhodococcus were isolated as single 
bands from a polyacrylamide gel, and the amino-terminal 
peptide sequences were determined using a protein 
sequenator. Two peptide sequences were recovered from 

25 the recombinant protein band: 

NH 2 -Gly-Gly-Pro-Val-Ser-Thr-Leu-Thr-Pro-Pro-Pro-Ala-Phe- 

and 

NH 2 -Gly-Pro-Val-Ser-Thr-Leu-Thr-Pro-Pro-Pro-Ala-Phe- 
These peptide sequences are identical to those deduced 
30 directly from the DNA sequence between positions 280 and 
320, and between positions 283 and 320, respectively. 
The elimination of a glycine residue at the N-terminus 
of one recombinant sequence is probably owing to a 



difference in post-trans lational processing by the host 
cell. 

Amino terminal sequencing of cholesterol 
oxidase from R^ rhodococcus identified the sequence 
NH 2 -Thr-Pro-Pro-Prb-Ala-Phe-Pro-Giu-Gly-Ile"Ala-Leu-Tyr- 

Gln-Gln-, 

which .corresponds to the peptide sequence deduced from 
the DNA sequence between positions 300 and 346. 
Therefore r the enzyme found in the 

S. lividans /pSL81&15 culture medium is either 6 or 7 
amino acids longer that the membrane bound enzyme found 
in R. rhodococcus . 

Processing of gram-positive secreted proteins 
often involves signal sequence cleavage followed by 
15 hydrolysis at a- secondary processing site.. In the case 
of native cholesterol oxidase this secondary site is at 
the threonine. Apparently such a secondary cleavage 
does not occur when the protein is synthesized and 
secreted extracellularly by lividans / 
20 Other Embodiments 

Other embodiments are within the following 
claims.. For. example , other species of any genus of the 
. Nocardioforms including Rhodococcus . NCI B 10554, R. 
erythropolis NCIB 9158, Nocardia ervthropolis ATCC 17895 
25 ' and Nocardia erythropolis ATCC 4277 could be used to 

prepare the genomic library \ or the libraries could be 
screened for other useful metabolic activities such as 
the assimilation of alkanes and aromatic hydrocarbons; 
the degradation of lignin, detergents, and pesticides; 
30 the production of enzymes useful in xenobiotic 

transformation; the biosynthesis" of antibiotics, amino 
acids and biosurf actants; and the production of other 
steroid modifying enzymes. such as cholesterol esterase. 

As other Strer>tomvces and also Bacillus species 
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share the advantages of lividans for the expression 
of heterologous genes, they could also serve as suitable 
hosts for the expression of DNA sequences derived from 
Nocardioform microorganisms. Other bacteria belonging 

5 to the high G+C subdivision of the gram-positive 
bacteria including any of the Nocardiof orms, 
Mycobacterium , Corvnebacterium , and Athrobacter would be 
suitable hosts for cloning DNA sequences derived from 
Nocardioform microorganisms. 

10 Sj. lividans /pSL8l can undergo rearrangement in 

one or more regions outside the cholesterol oxidase 
coding region and can retain cholesterol oxidase 
transforming activity. One such lividans/pSL81 
rearranged plasmid, has at least 3.6 kb of the 6 kb 

l5 Rhodococcus DNA. unchanged as determined by restriction 
mapping. In this rearranged lividans/pSLS 1 plasmid a 
DNA insertion of approximately 2.5 kb has occurred in 
the region to the right. of the PyuII site. Cholesterol 
oxidase was recoverable from cultures of lividans 

20 transformed with this rearranged plasmid,. in lower 
quantities than from lividans /pSL81 cultures. 

In still other embodiments expression of 
cholesterol oxidase may be increased by further 
manipulating the pSL'81A15 clone, as will be apparent 

25 to one skilled in the art. F.or example, a higher copy 
number vector could be used, such as, for example, 
pIJ303, described in Hopwood et al . , 1985, Genetic 
Manipulation of Streptomyces : A Laboratory Manual . Gene 
expression could also be enhanced by replacing the 

30 cholesterol oxidase promoter with a stronger exogenous 
promoters, such as ErmE promoter, described in Bibb et 
al., 1985, Gene, Vol. 38, pp. 215-226; aph promoter, 
described in Hopwood et al., 1985; or a tac promoter, 



. described in Amman et al., 1984, Gene, Vol. 25, pp. 167 
et seer. 

Deposit 

Under the terms of the Budapest Treaty on the 

5 International Recognition of the Deposit of 

Microorganisms for the Purpose of Patent Procedure, 
deposit of plasmid pSL81 has been made with the American 
Type Culture Collection (ATCC) of Rockville, MD, USA, 
where the deposit was given Accession Number . 

•10 Applicants 1 assignee, Genzyme Corporation, 

represents that the ATCC is a depository affording 
permanence of the deposit and ready accessibility 
thereto by the public if a patent is granted. All 
restrictions on the availability to the public of the 

15 material so deposited will be irrevocably removed upon 
the granting of a patent. The material will be 
available during the pendency of the patent application 
to one determined by the Commissioner to be entitled 
thereto under 37 CFR 1.14 and 35 U.S.C, S 122. The 

20 deposited material will be maintained with all the care 
necessary to keep it viable and uncontaminated for a 
period of at least five years after the most recent 
request for the furnishing of a sample of the deposited 
plasmid, and in any case, for a period of at least 

25 thirty (30) years after the date. of deposit or for the 
enforceable life of the patent, whichever period is 
longer. Applicants' assignee acknowledges its duty to 
replace the deposit should the depository be unable to 
furnish a sample when requested due to the condition of 

30 the deposit. 
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Claims 

1 1. A cloning vector comprising a DNA sequence 

2 derived from a Nocardiof orm microorganism, said DNA 

3 sequence encoding a protein, and said cloning vector 

4 being capable of expressing said DNA sequence in a host 

5 microorganism. 

1 2. The cloning vector of claim 1 wherein said 

2 Nocardioform microorganism is Rhodococcus s£. NCIB 10554. 

1 3. The cloning vector of claim 1 wherein said 

2 host microorganism is a gram-positive microorganism. 

1 4. The cloning vector of claim 1 wherein said 

2 host microorganism is a species of Streptomvces . 

1 5. The cloning vector of claim 1 wherein, said 

2 host microorganism is S^. lividans . 

1 6. The cloning vector of claim 1 wherein said 

2 DNA sequence encodes cholesterol oxidase. 
1 7. The plasmid pSL81. 

1 8. A microorganism transformed with the 

2 cloning vector of claim 1 or 2. 

1 9. A gram-positive microorganism transformed 

2 with the cloning vector of claim 3. 

1 10. . A cell of a species of Streptomvces 

2 transformed with the cloning vector of claim 4. 

1 11. A cell of the species S^. lividans 

2 transformed with the cloning vector of claim 5. 

1 12. A method for expressing a DNA sequence 

2 derived from a Nocardioform microorganism comprising 

3 providing a cloning vector containing said DNA 

4 sequence , 

5 transforming a host microbial cell with said 

6 cloning vector to obtain a recombinant host cell, and 

7 culturing said recombinant host cell under 

8 conditions permitting expression of said DNA sequence. 
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1 .13, A method for producing cholesterol oxidase 

2 comprising 

3 providing a cloning vector containing a DNA 

4 sequence derived from a Nocardioform microorganism, said 

5 DNA sequence encoding cholesterol oxidase, 

6 transforming. a host cell with said cloning 

7 vector to obtain a recombinant host cell, 

8 culturing said recombinant host cell under 

9 conditions permitting expression of said DNA sequence, 

10 and 

11 recovering the cholesterol oxidase. 

1 .14. The -method of claim 13 wherein said 

2 Nocardioform organism is Rhodococcus sp. NCIB 10554. 

1 15. The method of. claim 13 wherein said host 

2 cell is a gram-positive microorganism. 

1 16. The method of claim 13 wherein said host 

2 cell is a Streptomyces sp- 

1 17. The method of claim 13 wherein said host 

2 cell is lividans . . 

1 18, The method of claim 13 wherein said 

2 cloning, vector is plasmid pSL81. 

1 19. Recombinant cholesterol oxidase encoded by 

2 DNA derived from a Nocardioform microorganism. 

1 20. Recombinant cholesterol oxidase encoded by 

2 DNA derived from Rhodococcus sp. NCIB 1Q554. 

1 .21. Cholesterol oxidase of 4 claim. 19 or 20, 

2 essentially free of lipids found in said Nocardioform 

3 microorganism. 

1 .22. Cholesterol oxidase of claim 19 or 20 

.2 produced in a transformed heterologous host 

3 microorganism. 

1 23. Cholesterol oxidase of claim 19 or 20 

2 produced in a transformed host gram-positive 

3 microorganism. . 



- 21 - 



1 24. Cholesterol oxidase of claim 19 or 20 

2 produced in a transformed species of Streptomyces . 

1 25. Cholesterol oxidase of claim 19 or 20 

2 produced in transformed S. lividans . 

1 26. Cholesterol oxidase of claim 19 or 20 

2 produced substantially in extracellular form. 

1 27. Cholesterol oxidase of claim 19 or 20 

2 expressed without the addition of inducer. 

1 28. A cell transformed with a DNA sequence 

2 derived from a Nocardioform microorganism/ said DNA 

3 sequence encoding cholesterol oxidase. 

1 29. A method for the determination of 

2 cholesterol in a sample of fluid from a human comprising 

3 contacting said sample with recombinant 

4 cholesterol oxidase encoded by DNA derived from a 

5 Nocardioform microorganism and 

6 determining the extent of oxidation of 

7 cholesterol. 

1 30. The method of claim 30 wherein determining 

2 the extent of oxidation of cholesterol comprises 

3 measuring the consumption of oxygen with an 

4 oxygen electrode. 

1 31. The method of claim 30 wherein determining 

2 the extent of oxidation of cholesterol comprises 

3 measuring the production of hydrogen peroxide. 

1 32. The method of claim 30 wherein determining 

2 the extent of oxidation of cholesterol comprises 

3 spectrophotometrically determining the 

4 conversion of cholesterol to A -cholestenone . 

1 33. The cloning vector of claim 1, wherein 

2 said DNA sequence encodes an amino acid sequence 

3 comprising the sequence: 

4 Thr Pro Pro Pro Ala Phe Pro Glu Gly He Ala Leu 

5 Tyr Gin Gin. 
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1 34. The cloning vector of claim 33^ wherein 

2 said DNA sequence encodes an amino acid sequence 

3 comprising the sequence: 

4 Gly Pro Val Ser Thr Leu' Thr Pro Pro Pro Ala Phe 

5 Pro Glu Gly lie Ala Leu Tyr .Gin Gin, 

1 35. The cloning vector of claim 34 , wherein 

2 said DNA sequence/encodes ah amino acid sequence 

3 comprising the sequence: 

4 Gly Gly Pro Val Ser Thr Leu Thr Pro Pro Pro Ala 

5 Phe Pro Glu Gly lie Ala Leu Tyr Gin Gin. 

1 36* .The cloning vector of claim 35, wherein 

2 said DNA sequence encodes an amino acid sequence / 

3 comprising -the sequence: 

4 . Met Thr Ala Gin Asp Glu Lys Phe Arg Leu Ser Arg 



5 Arg Gly Phe Met Ala Ala Gly Ala Gly Ala Val Ala Ala Thr 

6 Ala Phe Ala Gly Trp Thr Pro Ala Tyr Ala Val Pro Ala Gly 

7 Ser Ser Gly Ser . Ala Gly Gly Pro Val Ser Thr Leu Thr Pro 

8 Pro Pro Ala Phe Pro Glu Gly* lie Ala Leu Tyr Gin Gin Ala 

9 Tyr Gin Asn Trp Ser Lys Glu lie Met Leu Asp. 

1 37* The cloning vector of claim 1, wherein 

2 said DNA sequence comprises the sequence: 

3 ACA CCG CCG CCC GCC TTC CCC GAA GGC ATC GCG CTG TAC CAG 

4 CAG 

1 38, The cloning vector of claim 37, wherein 

2 .said DNA sequence comprises the sequence: 

3 GGT CCT GTC TCC ACC CTC ACA CCG CCG CCC GCC TTC CCC GAA 

4 GGC ATC GCG CTG TAC CAG CAG 

1 39. The cloning vector of claim 38,. wherein 

2 said DNA sequence comprises the sequence: 

3 GGT. GGT CCT GTC TCC ACC CTC ACA CCG CCG CCC GCC TTC CCC 

4 GAA GGC ATC GCG CTG TAC CAG CAG 



1 40. The cloning vector of claim 39, wherein 

2 said DNA sequence comprises the sequence: 

3 CGAAG ATG ACG GCA CAA GAC GAA AAG TTC CGA CTG TCC CGA 

4 CGA GGT TTC ATG GCC GCT GGA GCC GGC GCC GTG GCA GCG ACC 

5 GCA TTC GCC GGC TGG ACG CCG GCC TAC GCC GTC CCC GCC GGC 

6 TCC TCC GGC TCC GCG GGT GGT CCT GTC TCC ACC CTC ACA CCG 

7 CCG CCC GCC TTC CCC GAA GGC ATC GCG CTG TAC CAG CAG GCA 

8 TAT CAG AAC TGG TCC AAA GAG ATC ATG CTC GAC 

1 41. The cloning vector of claim 40, wherein 

2 said DNA sequence comprises the sequence: 

3 GGGACTCCTG ATCTCAGCTT CCGTACTGGA GCGCGAAGCT CCTGCCCTGG 

4 CTGACGTAGT TCTCACTCTT GTCTGATACC AACCTGTCTG ATACCCACCT 

5 GTTAGAACTC ACCGTAGTTC TCGAACCCGA TGGAGTAGCC CGAAG ATG 

6 ACG GCA CAA GAC GAA AAG TTC CGA CTG TCC CGA CGA GGT TTC 

7 ATG GCC GCT GGA GCC GGC GCC GTG GCA GCG ACC GCA TTC GCC 

8 GGC TGG ACG CCG GCC TAC GCC GTC CCC GCC GGC TCC TCC GGC 

9 TCC GCG GGT GGT CCT GTC TCC ACC CTC ACA CCG CCG CCC GCC 

10 TTC CCC GAA GGC ATC GCG CTG TAC CAG CAG GCA TAT CAG AAC 

11 TGG TCC AAA GAG ATC ATG CTC GAC 

1 42. The cloning vector of claim 1 wherein said 

2 Nocardioform microorganism is ervthropolis NCIB 9158. 

1 43. The cloning vector of claim 1 wherein said 

2 Nocardioform microorganism is Nocardia ervthropolis ATCC 

3 17895. 

1 44. The cloning vector of claim 1 wherein said 

2 Nocardioform microorganism is Nocardia ervthropolis ATCC 

3 4277. 

1 45. The method of claim 13 wherein said 

2 cloning vector is plasmid pIJ303. 

1 46. The vector of claim 1, further comprising 

2 an ErmE promoter. 

1 47. The vector of claim 1, further comprising 

2 an aph promoter. 

1 48. The vector of claim 1, further comprising 

2 a tac promoter . 



Translation of CD2 ouer region 146-2029;. 



10 20 30 40 50 60 70 



GGGRCTCCTG RTCTCRGCTT CCQTfiCTGGfl GCCCCHflGCT CCTGCCCT66 CTGflCGTHCT TCTCBCTCTT 
CCCTGRGGRC TRGRGTCGRR GGCBTGBCCT CGCGCTTCM GGBCGGGRCC GHCTGCflTCH flGHGTGHGflfl 

00 90 100 110 120 130 140 



GTCTSRTRCC RBCCTOTCTO RTRCCCRCCT GTTRGBBCTC RCCGTHGTTC ' TCGBBCCCGfl TGGR0TR6CC 
CRGflCTfiTGG TTGGHCHGRC THTGGGTGGH CIMTCTTGflG TGGCRTCflHG BGCTTGGGCT RCCTCRTCGG 

ISO 160 170 180 190 200 

COflRG RT6 RCS OCR CRfl GflC GRA RRO TTC CGfl CTG TCC CGR CGfl GOT TTC RT6 GCC GCT GGfl 
GCTTC TRC TGC CST GTT CTG CTT TTC RRO GCT GflC RGG GCT GCT CCR RR6 TRC CGG CGR CCT 
Nflt Thr HI a 61 n R*p 61 u Lys Phi Rrg Leu Ser Rrg flrg Ely Phe Ntt Ria file Gly+ 

" " » m m m PtlTnTBBM<t ATlffJ Of C02 8 fl fl 0 0 S_+ 

210 i 220 230 240 250 260 

• • • * * * • m . • m * * . 



GCC GGC GCC GTG GCR GCG RCC OCR TTC GCC GGC TGG RCG CCG GCC TRC GCC 6TC CCC GCC 
CGG CCG CSS CRC CST CGC T66 CCT RES CGG CCG RCC TGC GGC CGG RTO CGG CRG GGG CGG 

Rla Gly Rio Hal Rla Ria Thr Bio Pho HI a Gly Trp Thr pro Ria Tyr flia ual Pro Rio* 
QiiTnTpnMCtnTiiw Qf C02—B a n n n — a — B_+ 



270 280 290 300 310 320 

• * mm* mm m m * m m 

GGC TCC TCC GGC TCC GCG G6T GOT CCT GTC TCC RCC CTC RCR CCG CCG CCC GCC TTC CCC 

CCG RGG RGG CCG RGG CGC CCR CCR GGfl CRG RGG TGG GRG TOT GGC GGC GGG CGG RRG GGG 

Gly ser ser Gly ser Rla Gly Gly Pro Vol Ser Thr Leu Thr Pro Pre Pro Bio Phe Pre* * 

" - « « » « mifrrranMgi htiim qf C02-J a a 0— _n 0 a -» 



330 340 350 360 370 360 

* « mm m mm m m m m m 

CRfl GGC GTC GCG CTG TRC CRG CRG GCR TRT CRG RflC TGG TCC RflR GRG RTC RTG CTC GRC 

CTT CCG TAG CGC GRC RTG GTC GTC CGT HTR GTC TTG RCC RGG TTT CTC TflG TRC GBG CTG 

Giu Gly lie Rla Leu Tyr Gin Gin Rla Tyr Gin R*n Trp Ser Lyt Giu He h*t leu fttp* 

« otiTnTftwjci btiik of C02_o — a — a — a — a — a — a — ■» 



FIG. 2.1 



3M 400 410 420 430 440 

CCC BTC TO RCC TOT TC8 CCC BBS BCG CCC Gflfl SRC 0TB "OTC OCC TCO C6B ACT 606 CCB 
COC T86 BCC TOO RCB BOT 000 TTC TEC COO CTT CTO CAT CBO COO BOC OCT T6B CCC GOT 
Ala lie Trp Thr cut sef Pro Lys Thr pro Oiu Btp Uai Ual Bio sir Bra Thr CI y Pro* 
n « ■ « » « HiiTnTBBWs l ht i oM OF COZ_a — a — a — o — n — a— a — * 



450 460 470 480 490 500 

TCC BBC OSC TBC BCC BTT COT CCC C6C GBQ CCB TEC OTO OBC OCC OCT BBC CAT COT Cflfl 
BOG no CCG BTO TOO TAB OCA 000 OCG CCC GOT BCO CBC CTO COO C6A CTO GTB CCB GTT 
Sor Btn Gtu Tur Thr 1 1 a Brg Pro Bro Gig pro Cut Ual Btp flla flit fitp Hit flrg Gin* 
qtiTnTnuMstnTioM OF C02-0 — n n n n — a — a — * 



510 320 530 540 550 560 

„,,**•«***** 

COO TCC OCC GOT CGB CUB GGT CBT CCT CGC COB CBC CBC EST CCB CCT CBC CGO COT CTC 
CCC BCO COS CCB GET GTT CCB OTB OOB CCG GCT GTQ GTG CCB GOT GOB OTO CCC CCB GRG 

Bro Cut Blo Olu Bra 01 n 01 y Hit Pro Brg Brg Hit Hit 01 y Pro Pro Hit Brg Brg Lou* 
- iwrnnMNSt BTinsi OF C02_a — o 0 o — a a — a — * 



570 900 590 600 610 620 

CBT CUB COC COG TGG CBO CCC OCC CBC COT CBC CGC BGG BCC CGB CGC OBC CCT CGB CGC 
GCft GTT OCG 6CC BCC 6TC BOO CCQ OTO OCB BTG GCG TCC TGO OCC OCO CTO GGB GCT GCG 

Bra Oln Brg Brg Trp Gin Pro Clu His Brg Hit Brg flrg Thr. Brg Brg Btp Pro Brg Brg* 
mtTfiTflfiSKi BTinM OF CD2_fl 0 n a — Q ft — ft — * 



630 640 ' 650 660 670 600 

■ •••*••••••* 

CAT CBC CBC CCC ACT GCB OOC BCB BGO CCT CG6 OTT COC CBB CTO CCG GCG CCC GOT OTG 
OTB 6T0 6TG OCO TOR COT CCG TOT CCC GOB OCC CBB OCO CTT BBC GGC COC 000 CCB CBC 
Hit Hit Hit flrg Thr flla Biy Tnr Big Pro Arg Vol flrg Olu Leu Pro Ale Pro Gig Ual* 

miTfmMMSi bthn OF C02-0 — a — a — o — o — a — a — * 



690 700 710 720 730 740 

TTG 8CC BTC OCC OOC TGC CTC CCC GTC OBC GCT CBC GGT GCB GCG CTC CCC GCC Gflfl GGC 
BBC TGO TAO CGG CCG BCG GflO COO CBB CTO COB GTG CCB COT CGC 6B6 GGG CGO CTT CCG 
Leu Thr He flla Giu cut Leu flla Ual flsp Ala Hit Olu f»« mo Leu Pro flla Olu eiu* 

«• . ■ ■ ■ « a ilTBTBflNSLBTlOM OF C02-0 0 0 8 8 0 a * 



750 760 770 780 790 800 

GBB GCB CBC OTT CCC OOB CBG BCT TTC GGC TCB CTC TCC BBC CTC GTC BCC TCC CTD HCC 
CTT CGT OTO Cflfl GOG CCT GTC TOB BRO CCG BGT GBG BGG TTO DBG CBG TGO AGO OBC TGB 
Glu Blo Hit ual Pro Giu Oln Thr Pho Gly Sor Leu Sar Btn Lw Uai Thr Ser Leu Thr* 

flMTOTBANSLBTIOM OF C02-0 8 8 B fi 8 8 * 
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810 820 650 

• * » * * ■ • 



B40 850 860 

• a • « 



60) GT8 OTC TOO BBC GSC ROT OHO TflC SCO CTC GAG BCC TBC OCO COT ROC CRT OCR GCG 
COT CRC CflO flCC TTO CC6 TCR CTC RTB CGC GAG CTC TOC BTG CGC OCR TCG CTfl CGT CGC 
nis Hoi Del Trp ftsn 8lu Sir Olu Tyr flla Leu Glu Thr Tyr Rio Rrg Ser Rip Rio Al«+ 
« " » » * » QtiTnTBCMciRTinu of C02_a 0 a a a a__a_-> 



670 880 090 900 910 920 

***»• •**«»*« 

RTC ARC CCO CTB CTG RCT CHC CTC BOfl CGC BCC TTC CTC RCC TEC GTG RCC TTO CR6 CCC 
TR6 HC 60C CRC SRC TGfl OTO ORG CCT GCG T68 RB8 ORG TOO BGO CRC TOO ARC 6TC COO 
lie Ly» Pro Ltu Ltu Thr Hit Leu Glu Bra Thr Phe Lou Thr Sir vol Thr Lou sin fll«r» 

« » m m m m flllTnTflBMCI HTIfXl QF C02_fl B 8 B 8 fl Q— 



930 940 050 960 970 980 

OCT CCC RHC TBC COC RTO COC TOC CTC RGC CRC RCC GflC RTC' GOT TOO CRO ORB CTC TTC 

CGR 606 TTG RTG GCG TBC OCO RCO CRO TCG GTG TOG CTG TAG CCR RCC GTC CTT ORG RRG 

flla Pro Rtn Tyr Rrg Nttt Rrg Cyt Uol Ser Hit Thr top lie 61 u Tn» Bin Olu Leu Phr* 

a otiTnrnBMti qtihm of C02_b n n n a « « ■» 



980 1000 1010 1020 1050 1040 

****■» *»*'••* * 

6GC 6CC CGC GGR GCO TCC 06B CCC RCC TTC ORO RRG TTC GTC COC GBR ARC GGT CGC GOT 
CCO COG GCG CCT CGC RGG CCT GCG T69 RRO CTC HC RRG CRO OCO CTT TTG CCR GCG CGT 

oiy hi a Rrg Gly Rio ser Gig flrg thr pne giu lus Pho vol Rrg Glu Rtn oiy Rrg file* 
« « » «' » » aiiTnTBm>jci htioj of C02LO a a a a a — a •» 



1050 1060 1070 1080 1090 1100 

ORB GCR RTC TOG TBC CCC TTC RCC Ofll) CGC CCG TGG RTG RRG GTG TOO TCR CTT GCC CCC 
CTT COT TAG RCC RTG GOO RBO. TGG ITT GCG GGC RCC TBC TTC CRC RCC RGT 6RR CCO C6G 
Olu flla (to Trp Tyr Pro Pho Thr Glu Rrg Pro Trp Nfet Lyt uai Trp Ser Leu at a Pro-* 

HTflTBHMt! HTIfM 0 F 002-0—8 8 0 0—8 D * 



1110 1120 1130 1140 1150 1160 

• a » * * m » * * • » m m 

RCC RRG CGS CCG TTC TCG CGT GRG GTG RCC 6GG CCC TBC ARC TBC RTC TTC TCC GflC ARC 
TGG TTC GCC GGC RAG RGC OCR CTC CAC TGG CCC G6G ATG HO RTG TAG RRG RGG CTO TTG 
Thr Lyt Rrg Pro Phe ser Rrg Glu vol Thr Gly Pro Tyr Rtn Tyr lie Pha Ser R*p ten* 
» « « " ■ « nnTtrrBiaMcioTinsj OF C02_a — a — a a— a — a — a — * 



1170 1100 1100 1200 1210 1220 

• • • • mm mm** • 

CTC CCS GRG CCG OTC RCC GflC RTO RTC GG6 CRG RTC RAC 6CC 66R RAC CCG GGC ATC CCA 
GRG GGC CTC GGC CRG TGG CTG TBC TUB CCC GTC TR6 TTO COO CCT TTG GGC CCG TBG CGT 
Leu Pro Glu Pro Uol Thr Rtp Ntt in Gly Gin Ho Rtn Rla Oiy flan Pro Gly He flier* 

* « * " « « miTflTBBMCI HT1IM Q? CC2-8 fl fl 0—4 0 B—* 



5/7 

1230 1240 1 250 1260 1270 1280 

CCO GCH TTC HOC CflC flTC BTG TBC BCC RCC flCC GTT 6CC BOB CTC GCC CCS BCC TTC TO 
CCC COT RRG TCC GTC TRG TflC BTG CGO TOO TOO CRfl CSC CCT GBG CGG CGC TOG RHG AGS 

pro flia Pho flrg Bin lit wt Tyr Bio Thr Thr Ual Bla Gig Leu filft bib Tnr Phe ser* 

f » " - - • BirrmmnMOttTinM OF C02_8 9 fl fl 11 n n * 

1290 1900 1310 1320 1330 1340 

»••••••«•••• 

BBC GRC CTO TOO 6GB TGG TCC BBS SBC GTC CBO TTC TBC BTC CfiG GCC BCC BCC CTT COT 
TTB CTB OBC BCC CCT BCC BB8 TTC CTO CBO GTC BBC BTG TBC CCC COG TGG TGG GBfl GCB 
ten Rip Lou Trp Sly Trp Mr Lyt flap Ual Gin Phe Tyr lla firg HI a Thr Thr Leu flrg* 
niiTnTnmjtiftrifM flp C02-8 — a — a — a — a — a — a — * 

1350 1350 1370 13B0 1390 1400 

,*«**••••*** 

CTO BCC GBG CGC GGC GCB GCC GTC HTC BCC TCC CGC GCC RHC HTC GGC CBG OTC BTC CflC 
GRC TGG CTC CCG CCS CCT CGC CBG TUG TGG RGB 6CG CGG TTG TBG CCG OTC CBS TBG GTG 
Leu Thr Glu Gig Gly Gty Rla Uol lis Thr Sor Brg Bio ten lie Gly Gin Ual lie HI«-» 

" BltTnTBFMglBTlIN OF C02_B 8. 8 fl 0 8 8 * 

HiO 1420 1430 1440 1480 1460 

GBC TTC BCC CBO TGG TTC BBC GGC CGC BTG GBG TBC TBC CGC TCT BTC GGfl CBG TTC CCC 
CTG BRG TGG GTC BCC BBS TTG CCO OCB TBC CTC BTG BTG GCG BOB TBO CCT GTC BB6 GGG 
Btp Phe Thr Gin Trp Phe ten Qly Brg Mit Glu Tar Tyr Brg Ser lit Gly Gin Pha Pro* 

- - » *i • m BHTflTIBflMS L AT 1 fN OF C02-8— 8 B 0 8 0 tt— .* 

1470 1400 1490 1500 1510 1S20 

* . ,«'•.«• • * • • • 

CTC BBC GGC CCC GTC GBfl BTC CGT TGC TCC GGC CTC GBT CAR CCC TCC GBC STC ORG GTC 
OBG TTG CCG GGG CBG CTT TBO GCB BCD BCG CCG GBG CTB GTT GGG RGC CTG CBG CTC CBG 

Lou ten Gly Pro Uol Glu lla Brg cya cys Gly Lou Bap Gin Pro sor ftp Ual Glu Ual* 

f . » » » » B IITnTHflNSLBTUN OF 012-8 0 .. ft B f> 8 8—"* 

1530 1340 1S50 1560 1370 15B0 

»••••••••*■• 

GBC TCG OCB CGC CCC CCC RCC BTC TCG GCC BTG CGC CCO CGC CCB 6RC CAT CCG GHB TGG 
CTG BGC CGT CCG GGG GGG TGO TBO ROC CGG TBC GCG GGC GCG 6GT CTG OTB GGC CTT BCC 
tep ser Bio Gly Pro Pro Thr lla Ser Bio Mrt Brg Pro Brg Pro ftp Hl» Pro Glu Trp* 
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GBC BCC GCC BTC TGG CTC BBC BTC CTC CGC GTC CCC G6B BCC CTC GGC BTG TTC GCG TTC 
CTG TGG CGO TBG BCC GBG TTG CBG BBS CCG CBG GGG CCT TGG 66G CCG TBC BAG CGC BRG 
ftp Thr Rla lis Trp Leu ten Ual Lau Gly Ual Pro Gly Thr Pro Gly N*t Phe flia Pho* 
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BTG GCG CTT TflC CTT CTC HCC TflC GGfl TTB CT5 BTG TTG TTG nG CTG CGO TGS HBO GCG 
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CCB Gflfl TGG TCC BAG GGC TGG CGT TEC CCC CCG BCB BSC CGT BCB CCG BCB CCC CGB TCB 
G5T CTT BCC BCD TTC CCG BCC GCB BGC CGO GGC TGT TCG GCB TGT GGC TGC GG6 GCT BGT 
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TCB CCC B68 CCC TCC CGC BGB CCT BCC GCG BC8 GCG TCC CBT CGB DCS BCB BCT 6GG BCB 
BGT GG6 TCC CGG BGG GCG TCT 6GB TGS CGC TGC CGC BGG GTB GCT CGC TGT TGB CCC TGT 
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CCG CCB BCG CCG CAT BCB BCG CGT TGS BTC CGC BCB BGG TCT TCB GCB BCB CCT TCC TGG 
GGC GOT TGC GGC 6TB T6T TGC GCB BCC TAG GCG TGT TCC BGB BCT CGT TGT BGB BGB BCC 
pro Pro Tnr Pro Hit Thr Thr flrg Trp No Arg Thr Arg s«r sor flla Thr Pro Sor Trp* 
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GCC BCT CCC CTB CTT TCG GGB GTG CGG CCG CCG TCG TGC GTT GCB GTB BGC CGG ATG BAG 
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Bla Thr Pro Leu Lou Sar Gig Ual Brg Pro Pro sor Cgt Ual Bla Ual Ser Gly N*t Lyt* 

_ m » m n . m A IITflTHRNSLflTI tN OF C02-8 8 8 8 fl 8 8 * 



2010 2020 2030 2040 2050 2060 2070 

. • . • - • ■ 

GHG TCC CCC CGC BTC CCG BCC BGG TAB C TGBCTCBATC CCGHTCBGGA JGGHTCGCCT JSTGCBTBCT 

CTC BGG GGG GCG TAG GGC TGG TCC BTT G BCTGRGTTRG CGCTBOTCCT ACCTA6CGGA GCBCGTBTGB 
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TTCCBCGGCC CTGGCBTGBG TTGflGBBCTC GEflCCGBGCC GBCCGBBTC8 TGGCGCTGTC GCTGCGCBCC 

RRGGTGCCGG CBCCGTBCTC RRCTCTCGBG CCTGGCTCGG CT6GCTTRGC BCCGCOBCflO CCBCGC6CG0 
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TGCGCTCTTC CCTCTCC8C0 COBGCGCGCB BCBBC6BCTC GGHCTTCGGG GTBTTTCGCB TCGGRGCRTT 
RCGCGBGIUIG GGBGBGGCGC GCCCGCGCGT TOTTGCTGBB CCTGRRGCCC CBTBBBGCGT BGCCTCGTHB 
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BTCCCHGRTT CBTCTCCCBT GBCTCBCBTG GBGC6CTCTC BCCCCCGGTG BCCBBBCC 

CRGGGTCTRR BTRBRB6GTR CTGflGTGTBC CTCGCBHGRG TGGBGGCCBC T60TCC88 
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